Correction: Chromosomal-Level Assembly of the Asian Seabass Genome Using Long Sequence Reads and Multi-layered Scaffolding

نویسندگان

  • Shubha Vij
  • Heiner Kuhl
  • Inna S. Kuznetsova
  • Aleksey Komissarov
  • Andrey A. Yurchenko
  • Peter Van Heusden
  • Siddharth Singh
  • Natascha M. Thevasagayam
  • Sai Rama Sridatta Prakki
  • Kathiresan Purushothaman
  • Jolly M. Saju
  • Junhui Jiang
  • Stanley Kimbung Mbandi
  • Mario Jonas
  • Amy Hin Yan Tong
  • Sarah Mwangi
  • Doreen Lau
  • Si Yan Ngoh
  • Woei Chang Liew
  • Xueyan Shen
  • Lawrence S. Hon
  • James P. Drake
  • Matthew Boitano
  • Richard Hall
  • Chen-Shan Chin
  • Ramkumar Lachumanan
  • Jonas Korlach
  • Vladimir Trifonov
  • Marsel Kabilov
  • Alexey Tupikin
  • Darrell Green
  • Simon Moxon
  • Tyler Garvin
  • Fritz J. Sedlazeck
  • Gregory W. Vurture
  • Gopikrishna Gopalapillai
  • Vinaya Kumar Katneni
  • Tansyn H. Noble
  • Vinod Scaria
  • Sridhar Sivasubbu
  • Dean R. Jerry
  • Stephen J. O'Brien
  • Michael C. Schatz
  • Tamás Dalmay
  • Stephen W. Turner
  • Si Lok
  • Alan Christoffels
  • László Orbán
چکیده

We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved long read correction for de novo assembly using an FM-index

Long read sequencing is changing the landscape of genomic research, especially de novo assembly. Despite the high error rate inherent to long read technologies, increased read lengths dramatically improve the continuity and accuracy of genome assemblies. However, the cost and throughput of these technologies limits their application to complex genomes. One solution is to decrease the cost and t...

متن کامل

LINKS: Scaffolding genome assemblies with kilobase-long nanopore reads

Motivation: Owing to the complexity of the assembly problem, we do not yet have complete genome sequences. The difficulty in assembling reads into finished genomes is exacerbated by sequence repeats and the inability of short reads to capture sufficient genomic information to resolve those problematic regions. Established and emerging long read technologies show great promise in this regard, bu...

متن کامل

LINKS: Scalable, alignment-free scaffolding of draft genomes with long reads

BACKGROUND Owing to the complexity of the assembly problem, we do not yet have complete genome sequences. The difficulty in assembling reads into finished genomes is exacerbated by sequence repeats and the inability of short reads to capture sufficient genomic information to resolve those problematic regions. In this regard, established and emerging long read technologies show great promise, bu...

متن کامل

Evaluation and Validation of Assembling Corrected PacBio Long Reads for Microbial Genome Completion via Hybrid Approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete mic...

متن کامل

Sequence analysis ScaffMatch: scaffolding algorithm based on maximum weight matching

Motivation: Next-generation high-throughput sequencing has become a state-of-the-art technique in genome assembly. Scaffolding is one of the main stages of the assembly pipeline. During this stage, contigs assembled from the paired-end reads are merged into bigger chains called scaffolds. Because of a high level of statistical noise, chimeric reads, and genome repeats the problem of scaffolding...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2016